amdgpu fixes for rpi5 #6947

pepijndevos · 2025-07-08T11:45:25Z

This cleans up the commit history of the amazing work by @Coreforge to make AMD GPUs work with the Raspberry Pi 5, following instructions by @6by9: geerlingguy/raspberry-pi-pcie-devices#222 (comment) and an explanation of the different parts of the patch: geerlingguy/raspberry-pi-pcie-devices#222 (comment)

So I've made

one commit with all the memset changes that to my understanding could potentially be upstreamed to mainline linux since they are just more correct.
one commit with a miscellaneous ttm_uncached change that may need an ifdef for arm only
one commit with all the volatile changes which are not that invasive but that coreforge suggested might need to be ifdefed as well for mainline acceptance

What is not included at the moment is the whole alignment machinery which to my understanding is more hacky and could be harder to get merged or might require significant changes. I'm not sure how essential that change is, but if desired I could include it as a separate commit as well. Or maybe the Ampere version of that trap could be used. fwiw, it seems llama.cpp works equally well without that patch applied from limited testing.

Just to be clear, I don't claim any authorship or even understanding of these changes, and am just trying to grease the wheels of getting these changes upstreamed as far as they will go, making it easier to use GPUs on Raspberry Pi, which I have a big interest in: https://sanctuary-systems.com/sentinel-core/

Coreforge · 2025-07-08T14:21:33Z

The alignment trap isn't needed if all userspace programs respect the alignment requirements. I guess llama.cpp might do that, so it works without it. Xorg I found did need it, even the arm64 build (or a userspace workaround like the memcpy library). The Ampere version should work as well, although I haven't tried it. The Ampere version also covers kernelspace, while mine only covers userspace, so more cards might work with it without extra changes, but at the cost of some performance (whether that would be noticeable or not, I don't know).

pepijndevos added 3 commits July 8, 2025 13:53

amdgpu: uses memset_io where applicable

3b1c3fa

amdgpu: use ttm_uncached

e770936

amdgpu: mark some variables volatile

72af394

pepijndevos force-pushed the rpi-6.12.y-gpu branch from fcca389 to 72af394 Compare July 8, 2025 11:54

pepijndevos mentioned this pull request Jul 8, 2025

Test GPU (AMD Radeon RX 6700 XT) geerlingguy/raspberry-pi-pcie-devices#222

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

amdgpu fixes for rpi5 #6947

amdgpu fixes for rpi5 #6947

pepijndevos commented Jul 8, 2025 •

edited

Loading

Uh oh!

Coreforge commented Jul 8, 2025

Uh oh!

Uh oh!

amdgpu fixes for rpi5 #6947

Are you sure you want to change the base?

amdgpu fixes for rpi5 #6947

Conversation

pepijndevos commented Jul 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Coreforge commented Jul 8, 2025

Uh oh!

Uh oh!

pepijndevos commented Jul 8, 2025 •

edited

Loading